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Abstract 

We propose a new set of stylized facts quantifying the structure of financial markets. The key idea is to 
f-H ' study the combined structure of both investment strategies and prices in order to open a qualitatively 

00 . new level of understanding of financial and economic markets. We study the detailed order flow on the 

Shenzhen Stock Exchange of China for the whole year of 2003. This enormous dataset allows us to 
compare (i) a closed national market ( A-shares) with an international market (B-shares) , (ii) individuals 
and institutions and (iii) real investors to random strategies with respect to timing that share otherwise 
all other characteristics. We find that more trading results in smaller net return due to trading frictions. 
We unveiled quantitative power laws with non-trivial exponents, that quantify the deterioration of per- 
formance with frequency and with holding period of the strategies used by investors. Random strategies 
are found to perform much better than real ones, both for winners and losers. Surprising large arbitrage 
opportunities exist, especially when using zero-intelligence strategies. This is a diagnostic of possible 
inefficiencies of these financial markets. 
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^ ; Introduction 

Nothing in biology makes sense except in the light of evolution. This famous sentence by Theodosius 
Dobzhanski [T] captures the fact that the extraordinary diversity of life can only be understood by 
combining the mechanisms of genetic evolution with historical environmental threads. Consider now the 
, common wisdom that, as a result of accumulated technological and financial innovations, societal and 

economic networks have never been more complex and that this complexity has reached unmanageable 
levels within the current understanding and methodologies PHI]- Moreover, this complexity is often 
accused to be at the core origin of the financial crisis that started in 2007, of the ensuing so-called 
Great Recession and of the continuing woes of major economies worldwide. In the spirit of Dobzhanski's 
statement, we here propose to investigate the concept that nothing in the complexity of financial markets 
make sense except in the light of the evolution of investors' strategies and of their mutual feedback 
loops. Rather than fixating on so-called stylized facts [5], we propose to study the combined evolution 
of financial patterns with the ecology of investors feeding on them and creating them. This is analogous 
to the importance of understanding the evolution of the fabric of social networks to make sense of the 
dynamics of human societies, the growth and organization of fault networks to account for the spatio- 
temporal organization of earthquakes, the structure of the brain and its plasticity to describe neural 
excitations and make progress on treating epileptic seizures, and so on. Similarly, the occurrence and 
severity of the financial crisis is best understood from the perspective of the accumulation of at least 
five bubbles over the last twenty years [5] associated with a climate of complacency everywhere and the 
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illusion of the "great moderation" [7] . 

The view that financial markets can be better understood as adaptive ecologies of co-evolving investors 
is not new. It has been explored in agent-based models [SHI 3] and articulated in the so-called "adaptive 
markets hypothesis" [14j . Here, our contribution is to provide novel empirical evidence based on the 
analysis of a unique datasct. 

The logic of our approach is based on the following points. 

1. Several important studies have shown that efficient allocation can result from the aggregation of 
decisions made by irrational or zero- intelligent agents under constraints j!5H17) . 

2. Many studies have repeatedly documented that most investors underperform the global market as 
well as simple buy-and-hold strategies [T5H2"0] , with only very few exceptions [H] . 

3. The structure of markets results from the aggregate impact of investors. 

4. Here, we show that random strategies are as a rule significantly better than even the best investors. 

5. We characterize the statistical properties of trading frequency and holding periods of investors and 
quantify their impact on performance. 

Of course, point 4 is self-fulfilling in the sense that a random strategy is applied to a system made 
by supposedly optimizing human beings who shape the market. If most strategics were random, the 
conclusions are likely to be quite different. Points 3-5 together imply that there are untapped investment 
and arbitrage opportunities, that very few investors actually profit from. This is very surprising given 
the ease with which zero-intelligence traders over-perform. The quantitative characterization of the 
performances of investors as a function of a few key observable characteristics that is presented below can 
be used by future prospective investors to improve their strategies. Of course, as more investors become 
wiser, the market characteristics will evolve in a way similar to the first-entry games in which random 
strategies are found ultimately to dominate |22j . Our main point is that the characterization of financial 
markets requires understanding the ecology of strategies and their characteristics and how they interact 
together to shape the very patterns they exploit. Arguably, this will provide for really more efficient and 
robust financial markets, designed to avoid future systemic crises. 

In this work, we perform a statistical analysis of the performance of all the investors trading 32 A- 
share stocks and 11 B-share stocks on the Shenzhen Stock Exchange of China in 2003. This market offers 
a unique opportunity to compare (i) a closed national market (A-shares) with an international market 
(B-shares), (ii) individuals and institutions and (iii) real investors to random strategies with respect to 
timing that share otherwise all other characteristics. The analysis is conducted separately for A-shares 
and B-shares. The database contains the information of each order including (i) the masked ID of the 
trader, (ii) whether he is an individual or institution, (iii) the direction, (iv) the price, (v) the size of the 
order, and (vi) the time stamps accurate to 0.01 second [23]. The evolution of the A-share index and 
the B-share index is shown in Fig. SI. Interestingly, the indices are found to be outperformed by the 
1 /N portfolio strategy [53] . We find that the net return of A-share individual investors is negative and 
independent of the trading frequency, while that of A-share institutional investors, B-share individual 
investors and B-share institutional investors decreases with increasing trading frequency. In addition, 
the net return decreases for winners and increases for losers when the trading frequency increases. Wc 
also find that random trading performs better for all individuals and institutions and for all winners. We 
show that the performance of investors exhibit non-trivial power law dependence as a function of trading 
frequency and holding periods. 

The invisible hand with zero-intelligence agents 

Since Adam Smith's famous "invisible hand" description of economic and financial markets as self- 
regulating systems, economics has been dominated by the paradigm of rational utility maximizing agents. 
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Given restrictive conditions, the agents' collective actions are found in theory to lead to stable general 
equilibrium points that are characterized by optimal allocation of resources. However, starting with H. 
Simon, and expanding with the work of D. Kahneman and A. Tversky as well as many other scientists, the 
severe limitations of human cognition and the many biases in real people's decisions have been pointed 
out. These limitations and biases a priori cast doubts on the relevance of rational utility theory. In 
reply, many studies have shown that irrational households can lead in aggregate to rational markets. In 
particular, Gode and Sunder |15j used "zero-intelligence" computer agents (who do not seek or maximize 
profits, do not observe, remember, or learn) to simulate market transactions in a double auction. They 
found that a population of such agents, subjected to budget constraint, produced results that closely 
mirrored the allocation efficiency of a simultaneous experimental human exchange. Studying prediction 
markets, Othman |16j confirmed recently that prices that replicate the findings of empirical market stud- 
ies can emerge from a market populated by inhuman zero-intelligence agents with diffuse beliefs. Farmer 
ct al. [T7] developed a model of zero-intelligent agents that explains a large part of the cross-sectional 
properties of stocks traded in continuous double auction markets. They suggest that constraints imposed 
by market institutions may at times dominate strategic agent behavior, so that random agents with 
constraints perform on the whole as well as their more human siblings. 

Underperformance and the illusion of control 

In both single-player and multiplayer Parrondo games, two or more losing games when alternated pe- 
riodically or randomly yield a net winning outcome |25H28| . When an optimization rule is introduced, 
the Parrondo games produce degraded rather than enhanced returns [22j[29]. This "illusion of control" 
phenomenon is present in other agent-based models whose design is inspired by stock markets |30|. The 
convention wisdom states that institutions markedly outperform individuals because they are more in- 
formed [31 j - However, the performance of both professionals and laymen is often documented to be worse 
than chance 18 20,32 . Even worse, there is evidence showing that analysts' stock recommendation 
records are intentionally rewritten to a large extent [33] . Indeed, the performance of claimed success- 
ful strategies should be tested based on the method of random strategies, that are designed to remove 
survival and look-ahead biases [34] . 

The phenomenon of "Illusion of control" is one possible form of overconfidence. Overconfidence of 
stock market participants is expected to cause investors to trade more [35|l36j , which has been confirmed 
at the market and individual equity level [37] and at the individual level [38H40] . In addition, there is 
evidence that the higher the frequency of trading, the poorer is the performance [3T1|4"TH4"3"] . 

Materials and Methods 

Data sets 

The Shenzhen Stock Exchange (SZSE) was established on December 1, 1990 and started its operations 
on July 3, 1991. It contains two independent markets, A-share market and B-share market. The former 
is composed of common stocks which are issued by mainland Chinese companies. It is opened only to 
domestic investors, and traded in CNY. The latter is also issued by mainland Chinese companies, while 
it is traded in Hong Kong dollar (HKD). It was restricted to foreign investors before February 19, 2001, 
and since then it has been opened to Chinese investors as well. At the end of 2003, there were 491 
A-share stocks and 57 B-share stocks listed on the SZSE. In the year 2003, the opening call auction is 
held between 9:15 am and 9:25 am, followed by the cooling periods from 9:25 am to 9:30 am, and the 
continuous auction operating from 9:30 am to 11:30 am and 13:00 pm to 15:00 pm. 

Our analysis is based on a database recording the order flows of 43 liquid stocks extracted from the 
A-sharc market and the B-share market on the SZSE in the whole year of 2003 when the close call 
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auction was adopted in the opening procedure. The trading system did not show any information about 
the order flows, and traders submitted orders only according to the closing price of the last trading day. 
The database contains the price, size and associated time of each submitted order recorded in the opening 
call with the time stamps accurate to 0.01 second. Figure Q] shows the evolution of the two indexes. 
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Figure 1. Daily evolution of Shenzhen Component indexes for A-shares and B-shares in 2003 and the 
performance of their 1/N portfolios, respectively. 




Method 



Assume that there are / investors, each of them labeled i, and there are S stocks in the database under 
investigation, each labeled s. For investor i and stock s, we can construct a sequence of buy/sell activities, 
denoted A(i, s): 

vi,v 2 , ■■■ ,vj,--- ,v Js 
A(i,s)= pi,p 2 , ■ ■ ■ ,Pj, ■ ■ ■ ,P,j B , (1) 
ti,t%, ■ • • , tj, • • ■ , tj a 

which means that investor i sells Vj shares of stock s with price pj at time tj. If vj < 0, then "sell 
Vj shares" means "buys — Vj shares". Obviously, there are non-zero Vj among all entries. We use the 
convention that a positive sign corresponds to selling. 

In order to construct A(i,s), we need to reconstruct the order book. Assume that investor i places 
a limit order of size V at time t, which is executed later by k other effective market orders with 



sizes Vi, V2, ■ ■ ■ ,Vk with prices Pi, P2, • • ■ ,Pk at times Ti,T 2 , 
(vj , pj ,tj), where 

k 



, Tk . Then, we record only one entry 



Pj 



1 

m—1 



V P 



= V, U=T k 



(2) 



This is very important in the calculation of transaction costs, defined soon. More operations are needed 
for A(i, s) in order to make sure that 

Js 

There are at least two cases to consider. First, if there are several sell transactions without any preceding 
buy transactions, these sells should not be included in A(i, s). Second, at the end of year 2003, if investor 
i holds some shares of stock s, we added a new entry by including a virtual transaction selling all his 
shares. 



5 



For the j-th transaction (or cquivalently the j-th entry), the trading volume is PjVj. According to the 
Shenzhen Stock Exchange Trading Rules released in 2001. the transaction cost is determined as follows: 

Cj = ma,x{\p. j v j \{b i + e + /), 5} + \pjVj\d, (4) 

The four terms in Eq. ((4]) are the following: (i) Brokerage 6j, which should be less than 0.3%; (ii) Exchange 
fee e = 0.01475% for A-shares and ej = 0.0301% for B-shares for both buy and sell sides; (iii) Supervision 
fee / = 0.004% for both buy and sell sides; and (iv) Stamp duty d = 0.1% for sellers only. The sum of 
bi + e + / should be less than 0.3% with a minimum of 5 CNY for A-shares and 5 HKD for B-shares for 
both buy and sell sides. Note that bi is i-specific and independent of stock s. 

Therefore, the total invested capital (the money that investor i spent to buy stock s) is Bi S = 
— ^2 V <0 VjPj and the transaction cost of investor i buying stock s is C^ s = <o c j- The total capital 
obtained by selling all the shares of stock s is Sj )S = Ylv >o v jPj ano - ^ ne transaction cost of investor i 
selling stock s is Cf s = 53„ >o c i- The total transaction cost of investor i in his investment of stock s is 

Cj,s = C, b s + C? s , (5) 

The total earning is 

Ei.s = Si :S — Bi, s — Ci^ s + D iyS , (6) 

where Di tS is cash dividend received by agent i from stock s over the one period. The portfolio return of 
investor i can be calculated as follows 

Ri = > 



=x>7 (e*m+e<$.) 

s=l / \s=l s=l / 



The number of transactions (frequency of trading) is the sum of all J values of investor i 

s 

J,=5> s . (8) 



Results 

Basic statistics 

In our database, there are 2,330,093 A-share investors with 2,315,664 individuals and 135,086 B-sharc 
investors with 88,779 distinct individuals. It is found that the proportion of institutional investors is 
much higher in the B-share market (34.28%) than in the A-share market (0.62%). For each investor i, we 
calculate his portfolio return Ri. For A-share investors, 51.95% individuals and 68.50% institutions are 
net winners above the zero benchmark. For B-share investors, 85.76% individuals and 93.83% institutions 
are winners above the zero benchmark. To have a full understanding of how many of the traders are 
really performing, the box plots of returns for each type of investors in each market are given in Fig. [2] 
Some interesting observations are obtained. 

• The proportion of winning traders in the B-share market is much higher than in the A-share market. 
This may be associated with the fact that the B-share index gained a much higher annual return 
than the A-share index (see Fig. SI). Passive or even under-performing agents perform better on 
an absolue basis, the higher the upward trend of the underlying market. 

• In both markets, the winning proportion of institutional traders is much higher than retail traders, 
which is consistent with recent results found for the Taiwan market |31j . 

• In each market, individual investors may gain higher returns than institutions or incur greater losses 
(see Fig.©. 
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Figure 2. Box plots of the returns in four classes: Individual A-share investors, institutional A-share 
investors, individual B-share investors, and institutional B-share investors. For each box plot, the 
minimum, lower quartile, median, upper quartile, and maximum of each class of returns are given. 



Trading frequency and return 

Figure [3] shows the dependence of the average returns R as a function of the trading frequency J for 
A-share individuals (a), A-share institutions (b), B-share individuals (c), and B-share institutions (d), 
respectively. One can observe that the return is statistically independent of the trading frequency for 
A-share individuals while, in the three other cases, R decreases systematically with J, indicating that 
trading is hazardous to investors' wealth not only for individuals but also for institutions |42j . Comparing 
the returns with the same trading frequency, institutions out-perform individuals. 

We also investigate the dependence of the net return as a function of trading frequency for two 
categories of investors, the winners and the losers. Winners (respectively losers) are defined as those 
having a positive (respectively negative) return. The classification is thus performed on an absolute (and 
not relative) basis. Figure [3] shows that return decreases with trading frequency for winners and increases 
for losers. The enhanced performance of losers by increasing trading frequency cannot be explained by a 
learned ovcrconhdcncc bias as described in Refs. |351l36] . 

Figure [3] also presents the average returns that random trading would yield in these markets. Random 
strategies are generated by considering each investor individually in turn, choosing random times for their 
trades while otherwise keeping fixed all other characteristics such as his number of transactions (trading 
frequency) and the trade sizes on each stock. Specifically, in Eq. ([1]), for a given investor i and a stock s, 
the variables J a and Vj,j = 1, • ■ • , J s are unchanged, while the times tj,j = 1, • ■ ■ , J s are replaced by a 
randomly chosen time sequence. As a result, the prices pj are also changed. This is done 2000 times for 
each investor, generating overall a very large number synthetic outputs contributed over all the investors 
in our database. For a given frequency J, we sort again these many outputs into two classes: (i) the 
winners are the random strategies with a positive return; (ii) the losers are the random strategies with 
a negative return. We then compute separately the average returns (and their standard deviation) of 
the winning and of the losing random strategies, as well as the overall average return of these random 
strategies. Note that this construction of random strategies tests specifically the skills of investors with 
respect to timing, since all the other characteristics are kept otherwise identical [34]. According to Fig. 2, 
the aggregate net return of random trading (black lines) is higher than that of real trading (black circles) 
with the same trading frequency in every case. More impressively, the aggregate net returns of A-sharc 
individuals are negative, while the net returns using random trading strategy are positive. Two closely 
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Figure 3. Performance comparison of strategic trading (real data) and random trading using the 
average values of return R versus trading frequency J. We exclude the sell transactions without any 
preceding matching buys. The simulations for the random strategies are repeated for 2000 times. We 
show the results for individuals in A-share (a), institutions in A-share (b), individuals in B-share (c) 
and institutions in B-share (d), respectively. In each plot, the colorful symbols (o, A, V) correspond to 
strategic trading, the continuous lines correspond to random trading, and the dashed line indicates the 
base line of zero return (R = 0). 

related conclusions can be drawn: (i) real trading is not random but strategic; (ii) the performance of 
strategic trading is worse than random trading. For the winners, random trading also induces higher 
return than real strategic trading in all four cases. For losers in the A-share market, random trading 
performs slightly worse. For losers in the B-share market, the random trading and real trading yield 
almost identical net returns. 

Holding time and return 

Figure [4] shows the average return R as a function of the average holding time At for A-share individuals 
(a), A-share institutions (b), B-share individuals (c) and B-share institutions (d), respectively. This 
figure is different from Fig. [3] because holding time is not simply the inverse of trading frequency. Indeed, 
the total holding time J At is not constant for different investors. In general, the aggregate net return 
increases with average holding time. This is consistent with the conventional wisdom that the buy-and- 
hold strategy outperforms most other strategies (see Fig. SI for the performance of the buy-and-hold 
strategy of the 1/N portfolio). For winners, the return is large when the holding time is long. For losers, 
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R decreases with respect to At in the A-share market and varies slightly in the B-share market. The solid 
lines with error bars are the average simulation results of random trading. For large At and B-shares, we 
still see that random trading performs better. 




At At 

Figure 4. Average returns R versus average holding time for individuals in A-share (a), institutions in 
A-share (b), individuals in B-share (c) and institutions in B-share (d), respectively. The symbols 
present the average values over all investors (o), as well as investors who earn positive return (A) and 
negative return (v). The dashed line delineates the benchmark in absolute terms of zero return. The 
solid lines with error bars are the average simulation results of random trading. The insets in panels (c) 
and (d) are magnifications of the curves for small values of At. 



Quantitative relations linking return to trading frequency and holding period 

The winners or losers (individuals or institutions in a market) are sorted according to their trading 
frequencies. The averages of the returns and holding times of each group of investors are calculated. We 
plot the magnitude of the average returns for winners and for losers as a function of the trading frequency 
in double logarithmic coordinates in the first column of Fig. [5] and observe a power law relationship 

R ~ J- a , (9) 

where a = 0.31 ± 0.01 for A-share individual winners, a = 0.38 ± 0.01 for A-share individual losers, 
a = 0.20 ± 0.04 for A-share institutional winners, a = 0.11 ± 0.04 for A-sharc institutional losers, 
0.20 ± 0.01 for B-share individual winners, 0.39 ± 0.02 for B-share individual losers, 0.24 ± 0.02 for 
B-share institutional winners, and 0.46 ± 0.09 for B-share institutional losers. 
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Figure 5. Power-law relationships between three variables. The first column (a,d,g,j) shows the 
dependence between the magnitude of the average return R and the trading frequency J. The second 
column (b,e,h,k) shows the dependence between the magnitude of the average return R and the holding 
time At. The third column (c,f,i,l) shows the dependence between the magnitude of the average return 
R and the holding time At and the trading frequency J. The four rows are for A-share individuals, 
A-sharc institutions, B-share individuals, and B-sharc institutions, respectively. 



Similarly, the magnitude of the average return R scales with respect to the average holding time At 
as a power law 

R ~ At&, (10) 
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and the average holding time At also scales with respect to the average trading frequency J as a power 
law 

At-J-T, (11) 

as illustrated in the second and third columns of Fig. [5l The power laws are statistically more significant 
for individual investors than institutional investors because there are many more individuals (about 
99.38%) in the A-share market. The estimated exponents are listed in Table [TJ For each power-law 
relationship, the exponents for winners and losers are approximately equal to each other, that is, 

-^winner, investor, market ~ -^loscr, investor, market 5 

(12) 

where E — a, /3, or 7, "investor" could be individuals or institutions, and "market" could be A-shares or 
B-shares. 

Combining Eqs. (j^lllip. we obtain an equation relating the three power-law exponents 

a = /? 7) (13) 

which is validated in Table [T] 

Table 1. Estimated exponents of the power-law relationships 
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Discussion 



The original incentive of this study was to provide novel comparative characterizations of financial markets 
that are based on the realized performances of strategies implemented by investors. The Shenzhen Stock 
Exchange of China offers a unique opportunity to compare (i) a closed national market (A-shares) with 
an international market (B-Shares), (ii) individuals and institutions and (iii) real investors to random 
strategies with respect to timing that share otherwise all other characteristics. 

The first robust result is that more trading results in smaller net return due to trading frictions. This 
is true for both individual and institutional investors in China's B-share market. However, the net return 
of individual investors in the A-share market is independent of the trading frequency, which is different 
from other markets [3T1I4TH43] . For individual or institutional winners, this result holds again. We 
unveiled quantitative laws showing how the deterioration of performance scales with frequency and with 
holding period. Naively, we could have expected that the performance is simply inversely proportional 
to the trading frequency, if transaction costs was the only contribution. But here, we find non-trivial 
exponents, which reveal the complexity of the market price structure as the investors strategically adapt 
their investments. These results provide a new set of stylized facts that characterize the structure of the 
price patterns. In other words, the properties of the returns obtained by different investors provide a 
kind of "spectroscopy" of the prices. 

We also found that the return of real trading is significantly and robustly worse than random trading. 
As a consequence, we can conclude that investors do try to develop opportunistic strategies, but zero 
intelligence strategies outperform them in stock trading. Certainly, this conclusion does not deny the 
possibility that some investors do perform better than random trading. Therefore, we can use the 
strategy performance as a gauge or an instrument to characterize the market structure, in addition to 
the its statistical properties often referred to as the stylized facts. To the best of our knowledge, this idea 
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is novel. It reflects the natural consequence that the aggregation of strategics make the stock market 
structure what it is, and vice- versa the later influences and co-evolve with the ecology of strategies |11) . 
The strategics implemented by investors are not only probing the prices but also influencing the prices 
so that they are both cameras and engines [55]. We believe that the study of the combined structure of 
both strategics and prices will open a qualitatively new level of understanding of financial and economic 
markets. 
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